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THE  ASYMPTOTIC  DISTRIBUTION 
OF  CERTAIN  CHARACTERISTIC 
ROOTS  AND  VECTORS 

T.  W.  ANDERSON 

COLUMBIA  UNIVERSITY 


1.  Introduction 

In  a  number  of  problems  in  multivariate  statistical  analysis  use  is  made  of 
characteristic  roots  and  vectors  of  one  sample  covariance  matrix  in  the  metric  of 
another.  If  A*  and  D*  are  the  sample  matrices,  we  are  interested  in  the  roots  </>*  of 
|  D*  —  <f>*A*  j  =  0  and  the  associated  vectors  satisfying  D*c*  =  <£*A*c*.  In 
the  cases  we  consider  A*  and  D*  have  independent  distributions.  Each  is  dis- 

Q 

tributed  like  a  sum  yay'a  where  yi,  .  .  .  ,  yq  are  independently  normally  dis- 

a3”! 

tributed  with  common  covariance  matrix.  In  the  case  of  A*  the  means  of  the  vec¬ 
tors  are  zero;  in  the  case  of  D*  the  means  may  not  be  zero.  We  are  interested  in 
the  asymptotic  distribution  of  the  characteristic  roots  and  vectors  when  the  num¬ 
ber  of  vectors  defining  A*  increases  indefinitely  and  when  the  means  of  the  vectors 
defining  D*  change  in  a  certain  way.  The  form  of  the  limiting  distribution  de¬ 
pends  on  the  multiplicity  of  the  roots  of  a  certain  determinantal  equation  involving 
the  parameters.  If  these  roots  are  simple  and  different  from  zero,  the  asymptotic 
distribution  is  joint  normal.  If  the  roots  are  not  simple,  the  asymptotic  distribu¬ 
tion  is  expressed  in  terms  of  “uniform  distributions”  on  orthogonal  matrices  and 
a  normal  distribution. 

We  shall  first  state  our  problem  in  a  general  form  and  show  in  what  kinds  of 
statistical  problems  there  is  interest  in  these  characteristic  roots  and  vectors. 
Suppose1  xa(a  =  1,  .  .  .  ,  iVj  of  p  components  is  normally  distributed  independent¬ 
ly  of  xp(a  5*  P)  with  mean 

(1.1)  £Xa  =  BiZlo  -\-H2Z2a 

and  covariance 

(1.2)  £  (.Xa  —  £Xa)  (Xa—  £ Xa)  '  =£  , 

where  zia  and  z2a  are  vectors  of  fixed  variates  of  qi  and  q2  components,  respec¬ 
tively,  and  Bi  and  B2  are  p  X  qi  and  p  X  qi  matrices,  respectively..  We  shall  use 
the  notation  N{B\Z\a  +  B2z2a,  2)  for  the  distribution  of  xa. 

Most  of  the  research  contained  in  this  paper  was  done  while  the  author  was  Fellow  of  the  John 
Simon  Guggenheim  Memorial  Foundation  (at  the  Institute  of  Mathematical  Statistics,  Univer¬ 
sity  of  Stockholm,  and  the  Department  of  Applied  Economics,  University  of  Cambridge).  The 
work  was  sponsored  in  part  by  the  Office  of  Naval  Research. 

1  Unless  specifically  indicated  otherwise,  a  vector  is  a  column  vector;  a  prime  indicates  the 
transpose  of  a  vector  or  matrix.  Vectors  and  matrices  are  indicated  by  bold  face  type. 
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On  the  basis  of  a  sample  (*i,  zn,  221),  .  .  .  >  ( Xn ,  Ziv,  Zm)  the  usual  estimate 
of  B  =  (BiB2)  is 

o-=l  v  a  =  l  ' 

N 

where  z'a  =  (z'ia  z'2a)  and  z_  z’_  is  assumed  to  be  nonsingular.  The  columns 

a=  1 

of  B,  say  bu,  are  normally  distributed  with  means  ($«,  the  corresponding  columns 
of  B,  and  covariance 

(1.4)  e  (bu  -  3.)  (i,  -  3.) '  =  (2  *xY's  ’ 

\  a-l  /«» 


where  ^  za  z'^  indicates  the  element  of  the  inverse  matrix  in  the  w-th 
and  w-th  column. 


row 


(i>0“ 


consisting  of  the  last  qi  rows  and  <72 


Let  Q~l  be  the  submatrix  of  I 
columns;  this  is  also  given  by 

(1.5)  <?=22!.2L-22!X(2ZiX)  '22iX- 

a= 1  a“l  o”=l  '  a=l 


Then  (fi2  —  B2)@(fl2  —  B2)'  has  a  Wishart  distribution  with  covariance  matrix  £ 
and  <72  degrees  of  freedom,  denoted  by  fT(£,  g2).  If  g2  <  p,  this  distribution,  called 

«» 

singular,  is  the  distribution  of  ^  y^yi  where  yu  is  distributed  according  to 

u-l 

N(0,  £)  independently  of  yv  («^  v).  The  usual  estimate  of  £  is 

(1.6)  .1»=£  (x.-Bzu)(Xu-BZu)’=jrxX-Bjtz.<B' 

a=l  a=l  0  =  1 


divided  by  N  —  (qi  +  <72).  This  matrix  is  distributed  according  to  TT(£, 
N  —  qi  —  qi)  independently  of  B. 

Many  statistical  problems,  for  example  [3],  [6],  involve  the  roots  of 

(1.7)  \B2QB'2-<f>*A*\  =0, 


or  the  vectors,  for  example  [1],  [2],  satisfying 

(1.8)  ( B2QB’2-4>*A *)  c*  =  0  . 

The  p  algebraically  independent  vectors  c0  satisfying  (1.8)  may  be  normalized  by 

(1.9)  '  c*g'A*c'h  =  n8gh, 

where  n  =  N  —  <71  —  <72  and  8go  =  1  and  8gh  =  0  for  g  7*  h.  We  say  that  the 
solutions  of  (1.7)  and  (1.8)  are  the  “characteristic  roots  and  vectors  of  B2QB2  in 
the  metric  of  A*.”  If  we  wish  to  test  the  hypothesis  that  the  rank  of  B2  is  r  against 
the  alternatives  that  it  is  greater  than  r  we  use  the  p  —  r  smallest  roots  of  (1.7). 
If  we  assume  that  the  rank  of  B2  is  r  and  we  wish  to  estimate  B2  (or,  equivalently, 
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estimate  the  linear  restrictions  on  B2)  we  make  use  of  the  vectors  c*  satisfying 
(1.8)  for  the  p  —  r  smallest  roots  of  (1.7). 

In  this  paper  we  shall  study  the  joint  asymptotic  distribution  of  the  roots  and 
vectors  defined  by  (1.7),  (1.8),  and  (1.9)  when  n  =  N  —  qi  —  q2—*  00  and 

1  N 

—  zaz'a  approaches  a  nonsingular  limit.  The  asymptotic  distribution  of  the 

fl 

a*  1 

roots  alone  has-been  given  by  Hsu  [8].  We  find  it  convenient  to  make  use  of  some 
of  the  results  in  [8]  to  obtain  the  joint  asymptotic  distribution  of  roots  and  vectors; 
however,  the  method  used  in  the  present  paper  could  be  used  independently  of  [8]. 
We  shall  assume  throughout  the  paper  that  q2  ^  P- 

2.  Reduction  of  the  problem  to  canonical  form 

To  simplify  the  following  derivations  we  shall  transform  the  matrices  B2QB'2 
and  A*  so  that  they  have  distributions  with  fewer  parameters.  Corresponding  to 

(1.7)  and  (1.8)  in  the  sample,  we  have  the  population  equations 


(2.1) 

=0 

and 

(2.2) 

(BlQnB-i  —  TaS)  Y  =  0 

whereQn  =  —  Q .  Let  the  roots  of  (2.1)  be  t\(ii)  ^  r\{n )  ^  ...  ^  r£(w)  ^  0.  The 
w 

number  of  zero  roots  is  the  difference  of  p  and  the  rank  of  B2  for  each  n  for  which 
Qn  is  nonsingular  (in  particular  for  n  sufficiently  large).  Let  yi (»),...  ,  y p(n)  be  a 
set  of  corresponding  solutions  of  (2.2)  satisfying 

(2.3)  y  ;(i*)Xy*00  =  V 

Let  rn  =  [yi(«),  .  .  .  ,  yP(n) ].  Then  we  can  make  a  transformation,  for  example 
[7],  so  that  A*  is  replaced  by 

n 

(2.4) 

(3  =  1 

where  tjp  is  distributed  according  to  N( 0,  /)  independently  of  y*  (jS  5^  a),  and 
B2  QnB't  is  replaced  by 

(2.5)  ^=Dc(»)jrw 

where  y**(«)  is  distributed  independently  of  yt*(n)  ( g  5*  h)  according  to 
N[\/ttT0 (n)z0,  /]  where  T„(n)  is  the  nonnegative  square  root  of  r\{n)  and  ta  is  a 
vector  with  all  components  0  except  the  g-th  (for  g  £  p)  which  is  1 .  The  roots  of 

(1.7)  are  the  roots  of 

(2.6)  |  Dn  —  <i>An  |=0, 

and  the  vectors  satisfying  (1.8)  and  (1.7)  are  related  to  the  vectors  Ci(w),  .  .  .  , 
cp(n )  satisfying 

(2.7) 


(Dn-M:)c  =  0 
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and 

(2-8)  c'ACK  =  nhh 

by 

(2-9)  C*(M)=rnCfl- 

It  should  be  observed  that  rn  and  r\(n)  depend  on  n  because  Qn  depends  on  n. 

We  shall  first  find  the  limiting  distribution  of  c0(n )  and  fa{n)  asw->  ®  (that 
is,  as  N  —>  oo ).  Let  yg  =  y**(n)  —  %/nTg(n)zg.  Then 


(2.10)  Dn=  ^  [yg  +  VnTg(n)zg ]  [yg-\-  Vnrg  {n)zg] ' , 

(7  =  1 

and  yg  is  distributed  according  to  iV(0, /). 

Let  C„  =  [ci(»),  .  .  .  ,  cp(n)\.  Then  (2.7)  can  be  written 

(2.11)  DC*  =  AC*Q>  , 

N/  n  n  n  n  n  1 


where  4»„  =  [fa(n)&ij]  and  fa(n),  .  .  .  ,  (f>p(n)  are  the  roots  of  (2.6)  and  (2.8)  can 
be  written 


(2.12) 

If 

(2.13) 
we  have 

(2.14) 


C'AC  =  nl . 

n  n  n 


X  =C~l , 


A=X'Xn1 


(2.15) 


D  =X'*J>  X  . 

n  n  n  n 


We  shall  set  out  to  find  the  limiting  distribution  of  and  Xn  for  t„(»)  approach¬ 
ing  limits  as  n  — »  °o .  To  make  <!>„  and  Xn  unique  we  require  <t>i{n)  >  fa (n)  >  .  .  . 
>  fa{n)  and  xa{n)  >  0.  The  probability  is  0  of  a  D„  and  An  for  which  Xn  and  <E»„ 
are  not  uniquely  defined. 

Throughout  this  paper  we  shall  make  use  of  the  following  special  case  of  a 
theorem  of  Rubin  [9] : 

Rubin’s  Theorem:  Let  Fn(u )  be  the  cumulative  distribution  function  of  a  random 
vector  un .  Let  vn  be  a  ( vector  valued )  function  of  un,  vn  =  fn(un),  and  let  Gn(v)  be  the 
{induced)  distribution  of  vn.  Suppose  lim F„{u)  =  F  (u)  [in  every  continuity  point 

n  — > 00 

of  F(u)]  and  suppose  for  every  continuity  point  u  of  f{u),  lim  /„  (u„)  =  /  (u) ,  when 

n  — >  00 

lim  un  =  u  .  Let  G(v )  be  the  distribution  of  the  random  vector  v  =  f(u),  where  u  has 


n  — ►  w 

the  distribution  F{u).  If  the  probability  of  the  set  of  discontinuities  of  f(u)  in  terms  of 

F(u)  is  0,  then 2  /  , 

limGn  ( v )  —G{v). 


n — »oo 


2  We  could  justify  the  limiting  procedures  by  another  method  that  consists  of  extending  a 
theorem  of  L.  C.  Young  (“Limits  of  Stieltjes  integrals,”  Jour.  London  Math.  Soc.,  Vol.  9  [1934],  pp. 
119-126),  concerning  the  limit  of  fgn(u)dFn(u),  applying  this  to  the  characteristic  function  of 
fn(un),  and  thus  obtaining  a  restricted  form  of  Rubin’s  theorem. 
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In  our  case  the  components  of  un  are  linear  combinations  of  the  components 
of  the  matrices  An  and  Dn;  the  components  of  vn  are  linear  combinations  of  the 
characteristic  roots  and  the  components  of  the  characteristic  vectors.  The  distri¬ 
bution  of  un  approaches  a  limit  and  the  function  /„(u)  approaches  a  limit  (in  the 
above  sense).  We  shall  verify  that  the  discontinuities  of  the  limiting  function  are 
of  limiting  probability  zero.  Thus  we  can  deduce  the  asymptotic  distribution  of  the 
characteristic  roots  and  (normalized)  vectors  by  using  the  asymptotic  distribu¬ 
tion  of  An  and  D„  and  the  limiting  function. 

3.  Derivation  of  two  special  distributions 

In  order  to  derive  the  desired  asymptotic  distributions  we  need  to  obtain  the 
distributions  of  the  characteristic  roots  and  vectors  (in  the  metric  of  /)  of  a  sym¬ 
metric  matrix  B  in  two  special  cases.  Let  the  roots  of 

(3.1)  \B  —  pl\  =  0 

be  ^  \f/2  ^  ^  pp.  Let  the  characteristic  vector  satisfying 

(3.2)  Bh  =  tih  , 

and  h'h  —  1  be  h,  (i  =  1,  .  .  .  ,  p ).  If  pi,  ...  ,  pp  are  different  hi,  .  .  .  ,  hp  are 
uniquely  defined  except  for  multiplication  of  a  vector  by  —1,  and  h[hj  —  0, 
i  9^  j.  Let  H  =  (hi,  .  .  .  ,  hp).  Then  BH  =  WV,  where  W  =  (pAj).  Let  H'  =  G. 
Then  G  satisfies 

(3.3)  G'WG  =  B  , 

(3.4)  G'G  =  I . 

These  equations  define  W  and  G  uniquely  if  we  require  ga  ^  0  except  for  a  set 
of  B  of  measure  zero.  Since  it  is  trivial  to  obtain  the  distribution  of  H  and  'F 
from  that  of  G  and  'F,  we  shall  now  obtain  the  distribution  of  G  and  *F. 

First  we  consider  the  case  that  the  distribution  of  B  is  W  (/,  m)  (m  2:  p);  that 
is,  the  density  is 

(3.5)  C(m,  P)  |tf|  ("•-p-1)/2g-trB/2 
where 

(3.6)  4  p) 

»= 1 

and  “tr”  denotes  trace.  This  is  the  distribution  of 

m 

(3.7)  5=2u/“/’ 

/= 1 

where  Ui,  .  .  .  ,  Um  are  independently  distributed  according  to  iV(0,  I). 

Theorem  1.  Let  B  have  the  distribution  W (/,  m).  Then  G  and  *F,  defined  by  (3.3), 
(3.4),  the  restriction  that  *F  is  diagonal  with  diagonal  elements  in  descending  order 
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and  gn  ^  0,  are  independently  distributed.  The  density  of  the  diagonal  elements  of  is 


(3.8) 


7 rP/2 


( m—p— 1)/2  -  V  4>j/2 

e  i=i  1 


2'-/*n(r[i(*»  +  i-»))r[l(#+i-»)l ) 


v 

*n 

i=  1 


n 

j  =  i+l 


/or  pi  ^  pp  >  0  w  0  elsewhere.  The  distribution  of  G  is  “ uniform  .” 

Proof.  That  the  marginal  density  of  pi,  .  .  .  ,  pp  is  (3.8)  has  been  proved  by 
Hsu  [5].  It  remains  to  show  that  G  is  distributed  independently  of  T  and  “uni¬ 
formly.”  The  “uniform”  distribution  of  all  orthogonal  ^-dimensional  matrices  is 
given  by  the  (normalized)  Haar  measure  on  the  orthogonal  group;  that  is,  the 
(normalized)  Haar  measure  is  the  only  probability  measure  on  the  group  that  is 
invariant  under  the  group  operation  on  the  right  [4].  Since  we  require  ga  ^  0, 
our  definition  of  “uniform  distribution”  is  the  conditional  distribution  obtained 
from  the  Haar  measure  by  requiring  ga  ^  0.  For  this  part  of  the  space  the  prob¬ 
ability  measure  is  2P  times  the  normalized  Haar  measure. 

The  measure  on  the  space  of  u/  defines  a  measure  on  the  space  of  G,  ga  ^  0. 
Consider  any  measurable  set  H  in  the  space  of  all  orthogonal  matrices.  Let  the 
diagonal  matrices  with  diagonal  elements  +  1  and  —  1  be  Ji,  ...  ,  J??.  Let  H  — 

2  p 

Hi ,  where  JiHi  is  a  set  in  the  space  of  G,  ga  ^  0.  Define  the  measure  of  H  as 

t=i 

the  sum  of  the  measures  of  JiHi.  Now  let  us  show  that  this  measure  is  invariant 
with  respect  to  multiplication  on  the  right.  Let  Ei  be  the  set  in  the  space  of  u / 
that  maps  into  JiHi.  Let  H*  be  HP;  that  is,  II*  is  the  set  obtained  by  multiply¬ 
ing  each  element  of  H  on  the  right  by  the  orthogonal  matrix  P.  Then  H*  = 

^2  =  ^2  ^  J*  •  We  now  show  that  the  measure  of  H *  is  the  same  as  Hi.  Let 

H*  —  ^2  such  that  JjH*j  is  in  the  space  ga  ^  0.  Let  be  the  set  in  the 
i 

space  uf  that  maps  into  JiH*j.  Then  ^  E*.  =  P'Ej ;  that  is,  is  the  set 

i 

obtained  by  multiplying  each  (ui,  .  .  .  ,  Um)  by  P'  on  the  left.  The  measure  of 
P'Ei  is  the  integral  of  the  density  of  P'ui,  .  .  .  ,  P'um  over  Since  the  density 
of  P'ui,  .  .  .  ,  P'um  is  the  same  as  that  of  ui,  .  .  •  ,  um,  the  measure  of  P'Ei  is 
that  of  Ei.  Thus  the  measure  of  H*  is  that  of  H.  This  proves  that  the  measure  is 
invariant  with  regard  to  the  group  operation  on  the  right.  Since  there  is  only  one 
such  measure  on  the  group  of  orthogonal  matrices  with  total  measure  2P,  this  is  it. 
The  joint  distribution  of  <I»  and  G(ga  ^  0)  has  a  density.  This  density  does  not 
depend  on  G  because  the  density  at  and  G  is  the  same  as  at  «I>  and  G*  since  G* 
can  be  obtained  from  G  by  multiplication  on  the  right  by  some  orthogonal  matrix  P 
and  this  is  equivalent  to  transforming  B  to  P'BP  which  has  the  same  characteristic 
roots  as  B.  This  proves  the  theorem. 

Now  suppose  the  density  function  of  B  =  B'  is 

(3.9)  ir-p(p+l)/42-p/2g-trBV2 


CHARACTERISTIC  ROOTS  AND  VECTORS 


IO9 


that  is,  bn  (i  =  1,  .  .  .  ,  p\j  =  i,  i  +  1,  .  .  .  ,  p)  are  independently  and  normally 
distributed  with  means  zero;  the  variance  of  bu  is  1  and  that  of  bn  ( i  <  j)  is  \. 
Now  define  G  and  VF  (diagonal)  by  (3.3)  and  (3.4)  with  the  understanding  that 
the  elements  of  the  first  column  of  G  are  nonnegative.  The  ordered  roots  pi  are  not 
restricted  to  being  nonnegative. 

Theorem  2.  Let  the  symmetric  matrix  B  have  the  distribution  with  density  (3.9). 
Then  G  and  V,  defined  by  (3.3),  (3.4),  the  restriction  that  W  is  diagonal  with  diagonal 
elements  in  descending  order  and  g,i  ^  0,  are  independently  distributed.  The  density 
of  the  diagonal  elements  of  T  is 

(3.10)  2_p/!l  fir[jo+i  -*)  j  rvi.^n  fi  (*<-w 

t=l  i—  1  7  =  i+l 

for  =  •  •  •  =  and  0  elsewhere.  The  distribution  of  the  orthogonal  matrix  G  is 
uniform. 

Proof.  The  proof  that  the  marginal  density  of  pi,  .  .  .  ,  pP  is  (3.10)  has  been 
given  by  Hsu  [8].  The  remainder  of  the  proof  is  the  same  as  for  theorem  1  since 
the  density  of  P'BP  for  P  orthogonal  is  the  same  as  B. 


4.  An  asymptotic  distribution  when  all  population  roots  are  zero 

A  simple  case  of  our  main  problem  is  the  case  where  t„(w)  =  0  for  all  g  and  n. 
Then  Dn  =  D  has  a  Wishart  distribution  with  ^  p)  degrees  of  freedom  which 
does  not  depend  on  n.  In  this  section  we  shall  find  the  asymptotic  distribution 
of  Xn  and  <!>„  in  this  special  case. 

In  all  of  the  asymptotic  theory  we  use  the  result  [8]  that  as  n—*co 

(4.1)  tf„=  +  Mn-»/) 

is  asymptotically  normally  distributed  with  mean  zero.  The  functionally  inde¬ 
pendent  variables  are  statistically  asymptotically  independent  and  the  variances 
are  given  by 

(4.2)  £u\i  =  2,  £#!/=!,  i^j. 


The  matrices  Xn  and  <I>„  are  defined  by 


(4.3) 


- D=X'<PX  . 

/y^  1%  n  n  7 


(4.4) 


—  A  =X'X 

yi  n  n  n  7 


where  xlfin)  ^  0,  <!»„  is  diagonal  and  the  diagonal  elements  of  <I»n  are  labelled  in 
descending  order.  For  each  n,  Xn  and  4>n  are  defined  uniquely  except  on  a  set  of 
probability  zero. 

As  n  — *  co ,  —  An  approaches  the  stochastic  limit  I  and  —  D  approaches  the  sto¬ 
chastic  limit  0.  In  the  limit  Xn  must  satisfy 
(4.5)  /  =  X'X , 


and  4»„  must  approach  0  stochastically. 


no 
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To  obtain  the  full  asymptotic  theory  we  define  new  matrices  Wn,  Zn  and  ®„. 
For  any  matrix  Xn  we  have  an  orthogonal  matrix  0„  and  a  diagonal  matrix  A„ 
defined  by 

(4.6)  X'nXn  =  OnAnOn  , 

where  the  diagonal  elements  of  A  are  ordered  in  descending  size  and  o,\(n)  ^  0. 
Let 

(4.7)  G  =  O'A^O  , 

v  9  n  n  n  n  7 

where  the  elements  of  Ai/2  are  the  positive  square  roots  of  the  corresponding  ele¬ 
ments  of  A„  (the  roots  are  different  from  0  when  X'nXn  is  nonsingular).  Let 

(4.8)  Wn=XnG~1. 

This  is  an  orthogonal  matrix;  that  is, 

(4.9)  W„Wn  =  I  ■ 

Let 

(4.10)  /.=  V5W.(G.-/). 

Then 

(4.11)  Xn  =  WnG„  =  Wn  (/+  4)  =  W,  +  ~=Z„. 

We  notice  that 

(4.12)  W'nZn  =  Z'nWn  , 
because 

(4.13)  WX=  VnW'Wn  ( Gn-I)  =  Gn-/=  Gn-Ir  =  Vn  ( Gn-I)  W>fl=Z>n  . 

Now  let  us  show  that  (4.9),  (4.11),  and  (4.12)  define  Wn  and  Zn  in  terms  of  Xn 
(except  for  a  set  of  measure  0).  We  have 

(4.14)  X=WOXy20  . 

Let  W*  be  another  matrix  satisfying  (4.9),  (4.11)  and  (4.12),  with  possibly  a 
different  Zn.  Then 

(4.15)  W*'Xn  =  X'nW*  , 

(4.16)  X nW*'=W*X’n. 

Equation  (4.15)  is 

(4.17)  W*'  W  nOXn2  On  =  0^  OnW'nW*  . 

From  this  we  derive 

(4.18)  OnW*'WnOX i/2  =  A i/2OnWnW*On  • 

Let 

(4.19)  OnW*'WnOn  =  o*  . 

Then 

(4.20) 


O* A£/2  =  A ln/20*'  . 
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The  component  equations  are 

(4.21)  oW-tY'oJi. 

This  gives  us 


(4.22) 

From  (4.16)  we  derive 

(4.23) 


sl/2 

*  * 

o*  = - 0 

17  §1/2  Uji 


^1/2 

*  v  j  * 
o*  =  — - —  o* 
ua  §i/2  ui i- 


If  bi  bj,  o*j  —  0.  Therefore,  if  the  bi  are  all  different 

(4.24)  0*=/, 


and 

(4.25)  W*  =  W . 

Therefore,  except  for  a  set  of  measure  zero  of  Xn,  (4.9),  (4.11),  and  (4.12)  define 
W„  and  Z„  uniquely.  Let 

(4.26)  ©n  =  n&„  . 


Now  let  us  substitute  into  (4.3)  and  (4.4).  We  obtain 

(4.27)  D-W',*.Wn  +  -^&*,Wn  +  W.*M  +^Zn0nZn, 

(4.28)  U»  =  W'Zn  +  Z’nW„  +  ZX  • 

Together  with  (4.9),  (4.12)  and 

(4.29)  wa  (n)  +^=  za(n)  ^  0 ,  i=.l,...,p, 

(4.27)  and  (4.28)  define  ©„,  Wn  and  Zn  uniquely  for  each  n. 

For  given  Wn  =  W,  Zn  =  Z  and  ©„  =  ©  the  limits  of  (4.27)  and  (4.28)  ex¬ 
pressing  D  and  U  in  terms  of  Wn,  Zh  and  ©„  are 

(4.30)  D=W'®W, 

(4.31)  U=  W'Z+  Z'W=  2W'Z. 

If 

(4.32)  Wf  i^O, 

and  6i  >  6,  for  i  >  j,  then  (4.9),  (4.12),  (4.30)  and  (4.31)  define  W,  Z ,  and  © 
uniquely  in  terms  of  D  and  U  (except  for  a  set  of  D  and  U  of  measure  0).  Now 
we  wish  to  argue  that  if  we  take  (4.9),  (4.12),  (4.27),  (4.28),  and  (4.29)  as  de¬ 
fining  Wn,  (diagonal)  ©„,  Zn  in  terms  of  (nonrandom)  D  =  Dn  and  Z7„,  the  limit 
of  Wn,  ©„  and  Zn  is  the  solution  of  (4.9),  (4.12),  (4.30),  (4.31),  and  (4.32)  as 
n  — ■>  °o  for  Dn—>D  and  Un—>U  where  D  and  V  are  such  that  the  solution  is 
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unique  (the  exceptional  D  and  i/are  of  measure  0).  A  diagonal  element  of  ®n  is  a 
root  of 

(4.33)  =0. 

As  n  — >  oo ,  this  root  approaches  the  root  of 

(4.34)  |  2)  —  6I\  =  0, 

and  this  is  an  element  of  ©  defined  by  (4.9)  and  (4.30).  Zn  is  defined  (equivalently) 
by 

(4.35)  Zn  =  VH(.Xn-Wn) 

where  the  diagonal  elements  of  An  are  roots  of 

(4.36)  -^Un+I-8I  =0. 

Let  tiin)  be  the  i-th  root  of 

(4.37)  \Un-*I\  =  0  . 

Then 

(4.38)  3i(»)  = 

Clearly 

(4.3  9)  lim  Vn  [  1  —  6“1/2  (w)  ]  =  \  lim^,  («)  =  . 

n — >  oo  n — »  co 

Since  X'nXn—*I  and  On  is  orthogonal,  each  element  of  Zn  is  bounded  in  the  limit. 
Thus  the  norm  (any  standard  norm)  of  Z'nZn  and  the  norm  of  (Zn®nWn  + 

tr'©nzn)  +  -  Z'n%nZn  go  to  zero  as  n  — ►  <» .  Thus  each  element  of  Dn  —  W^©nWn 
w 

and  each  element  of  Un  —  2W„Zn  goes  to  0  as  n  — »  » .  Consider  the  matrix  function 
(P,  Q)  —  (D  —  W*'&*W*,  U  —  2  W*'Z*),  where  IT*  and®*  satisfy  our  usual  condi¬ 
tions  including  (4.32).  The  inverse  functions  W*,  0*,  Z*  (as  functions  of  P  and  Q) 
are  continuous  in  the  proper  domain  (except  on  the  exceptional  set).  Hence,  if  the 
norm  of  ( P ,  Q)  is  sufficiently  small  the  norm  of  (W*  —  W,  0*  —  0,  Z*  —  Z) 
must  be  arbitrarily  small.  If  wa  >  0,  then  w*\  >  0  for  norm  of  (P,  Q )  sufficiently 
small.  Then  wa(n)  for  n  sufficiently  large  is  bounded  away  from  0,  and  for  n  suffi¬ 
ciently  large  Wn{n)  satisfying  (4.29)  must  satisfy  (4.32).  Thus  Wn,  ®n  and  Zn  con¬ 
verge  to  W,  0,  and  Z  defined  by  (4.30)  and  (4.31). 

The  limiting  equations  (4.30)  and  (4.31)  define  W,  Z ,  and  0  uniquely  except 
on  a  set  of  Lebesgue  measure  zero.  The  discontinuities  can  only  occur  on  this  set. 
Now  considering  D  and  Un  as  random  matrices  we  observe  that  the  limiting  distri¬ 
bution  of  D  and  Un  is  absolutely  continuous.  Thus  the  conditions  of  Rubin’s 
theorem  are  fulfilled.  To  obtain  the  limiting  distribution  of  the  random  matrices 
Wn,  Zn  and  ®n  defined  in  terms  of  the  random  matrices  D  and  Un  we  need  only 
find  the  distribution  of  W,  Z  and  0  defined  by  (4.9),  (4.12),  (4.30),  (4.31)  and 
(4.32),  where  U  has  the  limiting  distribution  of  Un- 

The  distribution  of  W  and  0  is  that  of  theorem  1.  The  conditional  distribution 
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of  Z  given  W  and  ©  is  obtained  from 

(4.40)  Z=\WU. 

Thus 

(4.41)  £{z\w)  =  W£U=  0. 

Let 

(4.42)  U  =  (ui, .  .  .  ,  up)  , 

Z  =  (zi,  .  .  .  ,  Zp)  , 

W  =  (wi,  .  .  .  ,wp) . 

Then 

(4.43)  =  i^(u,u;)lV. 

Since  =  2  and  £m|,-  —  1  for  i  ^  j>  and  ffuijUki  =  0  otherwise,  then 

(4.44)  £u{Ui  =  /  +  £*•»• , 

CUiUj  =  Ejt 

where  t,y  is  a  matrix  with  1  in  the  i-th  row  and  j- th  column  and  0’s  elsewhere. 
Thus 

(4.45)  ^{z,z,'|  W)  =  \W<fhii+zit)W' 

=  Whi  +  Wjw'i)  . 

Since  U  is  normally  distributed,  the  conditional  distribution  of  Z  is  normal. 

Theorem  3.  Let  D  have  the  distribution  W(1 ,  qj),  qi  ^  p,  and  let  An  be  inde¬ 
pendent  of  Dn  and  have  the  distribution  W(I,  n).  Define  Xn  and  <i>„  by  means  of  (4.3), 
(4.4)  and  the  conditions  that  Xa(n)  2s  0  and  <S>„  is  diagonal  with  diagonal  elements 

in  descending  order.  Let  «<!>„  =  ©n  and  let  Xn  —  Wn  -f-  ^n>  w^ere  WnWn  =  / 

and  WnZn  =  Z„Wn.  The  limiting  distribution  of  ©„,  Wn  and  Zn  as  »— >  0°  is  the 
joint  distribution  of  0,  W,  and  Z  such  that  the  marginal  distribution  of  the  diagonal 
matrix  ©  and  the  orthogonal  matrix  W  is  that  of  theorem  1  with  m  =  q%  and  the  con¬ 
ditional  distribution  of  Z  given  W  and  ©  is  normal  with  mean  0  and  covariances 
given  by  (4.45). 

5.  An  asymptotic  distribution  when  all  population  roots  are  equal  but  differ¬ 
ent  from  zero 

Another  special  case  that  is  easy  to  treat  is  the  case  of  all  roots  of  (2.1)  being 
equal  but  different  from  0,  say,  T\(n)  =  ...  =  r£(«)  =  An  >  0.  Then 

(5.1)  Dn=F+  VnE„  +  n\J, 

where 

h 

(s.2)  f=2>»v;. 

0“  1 

and  En  is  composed  of  elements 

(5.3)  VA»  (ya  +  ya) . 
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We  are  interested  in  Xn  and  4»„  (diagonal)  defined  by 

(5.4) 


-F  +  —rEn+\I  =  X' * X  , 

n  V n  "  n  n  n  n  > 


(5.5) 


—  A  —  X'X  , 

/m  n  n  n  7 


with  a;t-i(n)  ^  0  and  the  diagonal  elements  of  <I>n  arranged  in  descending  order. 
Let 

1 


(5.6) 


Xn  =  Wn 


Vn 


Xn, 


where  W„  and  Zn  satisfy  (4.9)  and  (4.12).  Let 


(5.7) 


=  Ktl  +  ' 


1 

Vw 


®n  , 


where  ®„  is  diagonal.  Then  (5.4)  and  (5.5)  are 

(5 • 8)  ^ = (w” + ^ 7-)' (u+ ^  e") ^ 7-) 

1 


=  \I- 


,-[X  {Z'W  +W'Z)  +W’®  w , 

\/fl  nnn1  n  n'  1  n  n  n * 


+  '-lKG,z,+K<\K  +  \Z'jn\  , 


(5.9) 
that  is, 

(5.10) 


X(w'+~kz*)  {w*+-jX 

-L=  (An-nD  =  (\v:zn+Z’Wj  +-^z;zt 


Multiply  (5.10)  by  X„  and  subtract  from  V«  times  (5.8)  to  obtain 
v^x»u» 


(5.11)  -j~F+En — 4=  x.  (A.  -  ni)  =  w_ &w„+ [ w: %z„ + z; e.H'. 


nnn 


Vw 


nnn'  n  n  nJ 


H - ^  QZ  * 

n  n  n  n 


Let 

(5.12) 


Vw 


C4„  -  w/) 


Then  (5.10)  and  (5.11)  can  be  written  as 

(5.13)  u„  =  (w;zn  +  z:wj  +-^%, 

(5.14)  -^F+En-\U,  =  W:e,Wn  +  ^[W:»nZn  +  Z'0,WJ  +,’ W,- 


where 

(5.15) 

(5.16) 


1T>„  =  /  , 
WX  =  ZnW„  , 
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(5.17)  wa  (n)  +  ^=  Zn  (»)  ^  0  , 

Thus  for  a  given  n,  0„,  Wn  and  Zn  are  defined  as  functions  of  Un,  F  =  Fn  and  En. 
The  functions  are  unique  and  continuous  except  over  a  set  of  U„,  Fn  and  En  of 
measure  zero.  The  limit  of  the  functions  (as  Un  —>  U,  Fn^>  F,  and  En  — >  E )  is  the 
solution  to 

(5.18)  U=  W'Z+  Z'W=  2W'Z  , 

(5.19)  E  —  \U  =  W'&W, 

with  W  satisfying  (4.32)  and  where  X  =  lim  X„.  This  argument  is  justified  as  in 

M — >00 

section  3.  In  particular,  each  diagonal  element  of  0n  as  a  function  of  nonrandom 
Fn,  En  and  Un  is  a  root  of 

(5.20)  | ^~= /•„+£;+  VnKI-  ( 0.+  V^I)(k  +  -~^  ()|  =°, 
that  is,  of 

(5'21)  l(v5,*+*--xA)(,+VSftr-«r  I  -°- 

Since 

(5- 2  2)  F- +E •  ~  KV-)  (/+ ^ ~k  u’T^e  -  w  ’ 

the  ordered  roots  of  (5.20)  approach  the  ordered  roots  of 

(5.23)  \E-\U-  6I\  =0. 

As  in  section  4  we  can  argue  that  the  elements  of  Wn,  ©«  and  Zn  are  bounded 
for  Fn—+F,  En— >  E  and  Un— >  U.  Then  the  elements  in  (5.13)  and  (5.14)  which 
are  multiplied  by  1/Vn,  l/n  and  1  /n3/z  go  to  zero.  The  remainder  of  the  argument 
of  section  4  applies. 

We  can  now  use  Rubin’s  theorem  since  the  discontinuities  of  the  mapping  oc¬ 
cur  where  there  is  indeterminacy  and  the  set  of  E  and  U  where  this  occurs  is  of 
limiting  probability  0.  To  find  the  limiting  distribution  of  the  random  matrices 
Wn,  ©n,  and  Zn  we  need  only  consider  the  distribution  of  the  random  matrices  W, 
0,  and  Z  defined  by  (5.18),  (5.19)  and  (4.32)  (with  W  orthogonal,  0  diagonal  with 
diagonal  elements  in  descending  order),  where  the  random  matrices  E  and  U  have 
the  limiting  distribution  of  En  and  Un.  Let 

(5.24)  E  -  XU  =  V. 

The  density  of  E  and  U  is 

(5.25)  £g-[tr£*/(4X)  +  tr£7V2)/2=  £  g  — tr{  (V+Al7)*  +  2M7*)/(8\) 

=  ke  -tr(V,  +  2XC/K+X*l7I  +  2Xl7J)/(8X) 

=  ke  -tr[<XJ  +  2X)(l7+[X/(X«  +  2X)]V)2+[2X/(X’  +  2X)]E>}/(8X) 
=  ke  -(X  +  2)tr(Z7+[l/(X  +  2)]V)J/8e  -trV,/(4X1  +  8X)< 

This  marginal  density  of  V  is  normal,  and  the  conditional  density  of  U  is  normal. 
The  distribution  of  W  and  (2X2  +  4X)_1/20  is  that  of  theorem  2.  The  conditional 
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distribution  of  Z  given  W  and  0  is  normal  with  mean 

(5.26)  £{Z\W,  =  = 

The  conditional  covariance  between  two  columns  of  Z  is 

(5.27)  e\  iz,-e(z,\w,  &))[zi-e(z,\w,  en'iwr,  @i 

=  2\T4  (/5«+WiW<)  • 

Theorem  4.  Let 

*  _  _  _ 

(5.28)  '  2>n=  2  (y<r+  y/n  V\tg)  (yQ+  Vn  y/\&,) ' 

9  =  1 

where  the  p-dimensional  vectors  yi,  ...  ,  yq2  (?2  ^  P)  are  independently  distributed 
according  to  N( 0,  /);  let  An  be  independently  distributed  according  to  W{1 ,  ra).  De¬ 
fine  Xn  and  by  means  of  (4.3),  (4.4),  and  the  conditions  that  Xa(n)  ^  0  and  On  is 
diagonal  with  diagonal  elements  in  descending  order.  Let  ©„  =  \/n(<Pn  —  \nI)  and 

X n  =  Wn  +  zn,  where  W'nWn  —  /,  and  W'nZn  =  Z'nWn-  The  limiting  distribu¬ 
tion  of  0„,  W  n  and  Zn  as  n—>  <»  is  the  joint  distribution  of  0,  W  and  Z  such  that 
the  marginal  distribution  of  (2X2  +  4X)-1/2  times  the  diagonal  elements  of  the  diagonal 
matrix  0  and  the  orthogonal  matrix  W  is  that  of  theorem  2  and  the  conditional  distri¬ 
bution  of  Z  given  W  and  0  is  normal  with  mean  (5.26)  and  covariances  (5.27). 


6.  An  asymptotic  distribution  in  the  general  case 

Now  we  consider  the  general  case  of  the  population  roots  having  different  values. 
We  assume  that  the  multiplicities  do  not  depend  on  n,  but  the  values  may.  Let 

(6.1)  [r2(»)  8ij]  =  (\An)I  0  ....  0  O' 

0  X2  (»)/...  .  0  0 

0  0  ....  \k(n)I  0 

.0  0  -  0  0. 


where 

(6.3) 
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As  before,  this  defines  Wi(n)  and  Zli(n)  uniquely  in  terms  of  Xu(n).  To  make  Xn 
unique  in  this  case  we  now  require  that  the  elements  of  the  first  column  of  Xa(n) 
be  nonnegative. 

Let 


(6.4) 


<t>n  = 


Xi  (»)/- 


Then 

(6.5) 

(6.6) 


yjn 

0 

0 
0 


@1  (n) 


x2  (»)/+— 7=@2(»). . . 


.  .\h(n)I+~^=®h  («) 


-  ©fc+l(«) 

n 


l«D'=lnF  +  ^lE"  +  X 


The  submatric  equations  of  (6.6)  are 

(6.7)  ^  Ua  («)  +/  =  (n)XM  ( n ) 

=  w:  ( n )  W,  (n)  +-~  [W'i  (»)  Z„(»)  +  Z'„(») W< (»)  ] 

+  i  y'z'ii(n)Zli(n). 

n 

1 

(6.8)  -^£7,,(«)  =  ^X'uWXhW 

=  -^lW',(n)ZiAn)  +Z'j,(n)  W,(n)  ] 

+  -^'.Zti(.n)Ztl(n) ,  »^j. 

«  * 

The  submatric  equations  of  (6.5)  are 

(6.9)  ^*\i  + -^7=  £.••  (”)  +M»)/=  ^X'ki(n)  **(») Xki(x) 

=  2(5fc*^(w)  +Vn^i(w))(Xfc(M)/+^  0fc(w)) 

X  ^  5*,^,-  (w)  +  -^7=  Zki  (w)  ^  ^  Z/1+1. i  (w)  0/t+i  (w)  Zh+i,i  (w) 

=  Xt  (n)  W\  (n)  Wi  (n)  +  —  {]&■  (»)  0,^,  (»)  +  X,-  (w)  X 
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X  [I V'iMZuW  +/« (»)ll',(»)]!  +  '[Vw»)2;,(»)4,W 

nlkZ i 

+  Wi  ( n )  &i  («)  Za  («)  +Z'ii  ( n )  0*  («)  (w)  ] 

+  jjiz^.Zki  («)  0*  («)  Z*t-  (»)  +  Zk+i  >t(n)  ®h+i(n)Zh+i,t(n) , 

i^h+l , 

(6.10)  -Fh+ i,h+i  =  ^^Xk,h+i(n)  <&k(n)Xk,h+i(n) 

n  k 

=  —  £  ^^Zk,h+ 1  («)  (»)/+"y^  Ok  (n)  ^  («)  +^Wa+i  («) 

+  ^k+i.k+i  (»)  (w)  ^H^+i  («)  Zh+i,h+i  (w) 

=  Wh+ i  («)  0A+i  («)  W^+i  («)  +  X*  (w)  Z*  >A+1  (n)  Z*,fc+i  (w)  1 

nl  kZ f  J 

i  r  A 

+^372  +^w  ®/»+i(w)^/»+i>a+i(w) 

+  Za+i,*+i(h)  0*+i(»)  Wa+i(»)  ]  +^4+1.IH-iW  0A+l(w)ZA+i,fc+i(«)  , 

(6.11)  +  £,-,  («)  -  £.&((,)•*  (»)•*«(«) 

=  V(Mrt(i*)  +  ^  Z'n (») ) (xt (»)/+ ©*(»)) 

X  f  SkjWj  (ft)  2ty  (»)  )  +  ^j  ^A+l,i  (»)  0A+I  (w)  ^A+x.y  («) 

=  -4=  [X,  (»)  W',(»)Zti(»)  +  \i(.H)Z'i,  (»)  W,(«)  1 

ir  ii 

+  -  52  X*(w)Z*,(»)Z*y(w)  +  Wi(n )  @i(»)Zty(»)+Zy,(w)  0y(») 
wLf^i 

XWy(«)]+^[t  Zfct  (w)  0k(»)Z*y(w)  J 

+  \z'h+i.i(n)Oh+i(n)Zh+i„  (n) ,  i  ;  i,  j?*h+l, 

ni 

1  1  A+1 

(6.12)  (»)  =  y^Xki  (n)  <Pk(n)Xk,k+i(n) 

n  y/n 

=  2  (  (*)  +  z'*>  («) )  (**(»>/+  ^  («)  Zt*+i(»)  + 
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+  V»4+1'i(w)«  ^”~\7 n  Zk+i>h+i(n)'^ 

=  ^^Xf  (.n)W'i(n)Zi,h+i{n)  +  ^W'i(n)  0,(«)ZllA+i(«)  +  ^  Xfe(n) 
'XZki(n)Zk,h+i(n)  J  H~  ^3/2  [±  Zkii.1t)  ®k{lt)Zk,h+l{lt)  “H  ^A+l.j(w) 

X  0a+i(»)  Wh+\{n)  jj  ^A+i>» (w)  ®a+i(w)^1+i,*+i(«)  ,  i  5^  h  +  1  . 

For  fixed  F,  En  =  E  and  Un  =  V  (in  the  proper  domain)  the  above  equations 
define  the  orthogonal  Wt(«),  Zt-(w)  and  the  diagonal  0t(»)  uniquely  (except  for  a 
set  F,  £  and  U  of  measure  zero)  under  the  restrictions  that  the  elements  in  the 

first  column  of  Wi  in)  +"^7^  Zu  in)  are  nonnegative  and  that  the  diagonal  ele¬ 
ments  of  the  0i(«)  are  in  descending  order.  Now  subtract  /  from  each  side  of  (6.7) 
and  \iin)l  from  each  side  of  (6.9)  and  multiply  (6.7),  (6.8),  (6.9),  (6.11)  and 

(6.12)  by  y/n  and  (6.10)  by  n  and  let  n  — >  00 .  Using  the  fact  that  Wt(w)  is  orthogo¬ 
nal  and  (6.3),  we  obtain  the  limiting  equations  [for  X,-(»)  — >  X,-,  £„—>£,  and 
Un-+V] 

(6.13)  Uu=  2  W-Za, 

(6.14)  Ua  =  W'Zt]  +  Z^Wj  ,  i  *  j  , 

(6.15)  Eu  =  W'iOiWi  +  ZKiW’iZu  ,  ij*h+  1  , 

h 

(6.16)  Ffc+i,*+i  —  Wh+l®h+1Wh  +  1+  X*Z*,*+iZfc,A+l  , 

*=1 

(6.17)  Ea  =  XiW'Zij  -f-  \jZjiWj ,  i  *  j ;  i,j^  h  +  1  , 

(6.18)  £a+1  =  XtW'Za+1 ,  i  5*  h  +  1  . 

From  (6.13)  and  (6.15)  we  obtain 

(6.19)  Eu  -  \iUu  =  WfrtWi . 

From  (6.16)  and  (6.18)  we  obtain 

h  1 

(6.20)  Fh+i,h+i~  —  Ej,h+iEj,h+i  =  W^+i0a4-i1F^-i-i . 

i^i  A* 

Then  the  fact  that  Wi  is  orthogonal  and  the  requirement  that  the  elements  of  the 
first  column  of  Wi  be  nonnegative  define  Wi,  Za,  and  0<  uniquely. 

To  show  that  Wi,  Za,  and  0,  defined  by  (6.13),  (6.14),  (6.17),  (6.18),  (6.19) 
and  (6.20)  are  the  limits  of  the  matrices  defined  by  (6.7)-(6.12)  is  more  compli¬ 
cated  than  the  similar  demonstration  in  section  4.  We  shall  only  sketch  this  proof 
briefly.  First  we  should  like  to  prove  that  the  diagonal  elements  of  0i(»),  for 
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example,  converge  to  the  characteristic  roots  of  £u  —  Xil/n  as  Fn — ►  E,  En—>E 
and  Un  — *  U.  From  the  equation 

(6'21)  |if.+v;£»+A„-«(^«1+/)|=0 

we  can  show  that  the  first  r\  elements  of  <I»„  converge  to  Xi.  Then  we  need  to  show 
that  the  largest  root  of  (6.21)  minus  Xi(«)  times  \/n  converges  to  the  largest  char¬ 
acteristic  root  of  En  —  Xit/n.  That  can  be  argued  from  the  determinantal  equa¬ 
tion 

(6'2  2)  1 1”' E" +  A" ~  (w V*  +/)  (Xl  <»>  '  +  8I)  \ 

In  the  second  determinant  above  we  can  factor  out  \/n  from  the  first  n  rows. 
Then  as  n  — »  a>  there  are  sequences  of  roots  each  of  which  converges  to  a  char¬ 
acteristic  root  of  En  —  Xit/n.  Similar  arguments  suffice  for  ®i(«)  (i  5^  h  +  1). 
For  ®A+i(»)  we  can  use  a  slightly  more  complicated  demonstration.3 

Next  we  wish  to  argue  that  the  elements  of  Za(n )  are  bounded  as  n  — »  °°  (as 
E„— » E,  E„— >  E,  and  Un  — >  V ).  For  convenience  let  us  take  the  case  of  r»  =  1. 
Then  the  characteristic  vector  say  C\(n)  associated  with  the  largest  root  0i  satisfies 

(6.23)  [if„+^(£.-X1(n)J/„]  +  [A.-)M(»)/]-~=»i(-^=  [/„+/)] C  =  0. 
Then  the  components  of  Ci(n)  are 

(6.24)  «„(*)  =i(»)[n  ld(»)  ~M»)] 

(6.25)  cu  (n)  =  k  (n) ki  (n) ,  i  ^  1  , 

where  k(n)  approaches  a  finite  limit  and  &,(n)  are  bounded.  Using  the  same  rea¬ 
soning  for  each  characteristic  vector  (assuming  r,  =  1)  we  show  that  Cn  is  a 
diagonal  matrix  with  bounded  elements  plus  1  /y/n  times  a  matrix  with  bounded 
elements.  Thus  Xn  =  C71  is  of  the  same  form.  Therefore,  y/nxa{n)  =  z,j(n ) 
( i  j )  are  bounded.  From 

(6.26)  un  (»)  +  1  =  2  x\i  ^ 

=  x2a  («)  +  ^  (») 

}*i 

we  see  that  y/n[  1  —  **,(»)]  -  —  uu(n)  +  y/nx\j(n)  is  bounded.  Thus  Zu(n )  = 
Vn[l  —  «:«(«)]  is  also  bounded.  If  rx  9*  1,  essentially  the  same  argument  can  be 
carried  out  in  terms  of  the  partitioned  matrices.  Thus  the  norms  of  matrices  such 
as  Uu(n)  —  2W[(n)Zii(n)  go  to  zero.  The  argument  of  section  4  shows  that  Zu(n) 
approaches  Zu,  etc. 

3  Since  these  arguments  are  similar  to  Hsu’s  [8],  it  is  unnecessary  to  go  into  more  detail. 
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We  are  now  in  a  position  to  apply  Rubin’s  theorem.  The  discontinuities  occur 
where  W ,  Z,  and  ©  are  not  defined  uniquely  and  the  measure  of  U,  F  and  E  where 
this  occurs  is  zero.  Let  the  limiting  distribution  of  the  random  matrices  Un,  F, 
and  En  be  the  distribution  of  the  random  matrices  U,  F>  and  E.  Then  the  limiting 
distribution  of  the  random  matrices  Wn  (orthogonal  diagonal  blocks),  Zn  and  ©„ 
(diagonal)  defined  by  (6.7)— (6. 12)  is  the  distribution  of  W  (orthogonal  diagonal 
blocks)  Z,  and  ©  (diagonal)  defined  in  terms  of  the  random  matrices  U,  F,  and  E  by 
(6.13),  (6.14),  (6.17)-(6.20).  The  distribution  of  Eu  —  \iUu,  i  j*  h  -+-  1,  is  that 
of  section  5.  Hence,  the  distribution  of  Wi  and  ©,  is  that  given  there.  Since 
Eu  —  \iUu  is  independent  of  Ejj  —  \}Ujj,  i  j,  the  matrices  Wi  and  ©t,  i=  1,  .  .  .  , 
h  are  independent.  The  conditional  distribution  of  Zu  given  Wi  is  also  that  of  sec¬ 
tion  5  with  X  =  X„  ©  =  ©,,  W  =  Wi  and  p  —  r ». 

Now,  consider  (6.20).  An  element  of  Eith+i  is  eak  =  VX.ytff  (rx  +  .  .  .  +  r,-i  + 
1  ^  g  ^  rx  +  .  .  .  +  r,;  p  —  rh+1  +  1  ^  k  ^  p) ;  an  element  of  Fh+i,h+i  is 

fkk'  =  2  yk«yk'°  • Thus  an  element  °f  Fh+i,h+i—  y^YE'i,h+iEi,h+i  isS  yk<>yk'o 
7^1  1  =  1  A*  0  =  1 

P~rA+l 

—  ykgyk'a  =  •  This  matrix  order  rh+\  has  the  distribution 

*=i  fl=p-Vn+1 

W(I,  q2  —  p  +  fh+i)  and  is  independent  of  Eu,  i  5*  h  +  1,  and  Eti,  i  j .  The 
distribution  of  Wh+ 1,  ©a+i  and  2*+wh-i  is  that  of  section  4  with  W  =  Wh+ 1,  ©  = 
©A+i,  Z  -  Zh+i.h+1,  p  =  r*+i  and  m  =  q2  -  p  +  x. 

The  matrices  17, y  and  Eu,  i  j*  j,  are  independent  of  the  other  submatrices  (ex¬ 
cept  Eu  =  E[j  and  Uj{  =  Z7,'y).  From  (6.14)  and  (6.17)  or  (6.18),  we  obtain 

(6.27)  Ea  -  \jUij  =  (X,-  -  Xy )WiZa  . 

The  conditional  distribution  of  7,y  given  W,  is  that  of 

(6.28)  r-i-r-  Wi  (Ea  -  Xy Ua) . 

hi  hj 

An  element  of  Ey  is 

(6.29)  e„f  =  V\jy0f+  V\iy/0  ,  rx-\- .  .  .  +  rx~x-\- 1  ^  g  ^  rx-\- .  .  .  +  r,- , 

+  ■  •  •  +  *7-i+  1  =  /  =  ri  +  •  •  •  +  fj  • 

Thus  an  element  of  E, y  —  XjUa  is  normally  distributed  independently  of  the  other 
elements  with  zero  mean  and  variance  X,  +  Xy  +  Xy.  The  conditional  distribution 
of  Zn  given  W,  is  normal  with  mean  0  and  covariance  between  two  elements  is 

(6.30)  £{  Zg/Zg'f'  |  W i)  =  tt  — - .  &  ^2wok  (gfc/  ~  XyM*;/)  w'g\{ek‘f'—\jUk,j') 

\A»  Ay;  ic.k' 

=  - m  wokwo'i'  5//'  (  Xi  +  Xy  +  Xy) 

\Aj  Ayy 

=ix5f^yi(Xi+x'+x')- 

Thus  the  variance  of  zaf  is  (Xt-  +  Xy  +  Xf)/(Xy  -  Xy)2  and  the  covariances  are  zero. 
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The  conditional  distribution  of  Za  is  similar  except  that  i  and  j  are  interchanged. 
Zij  and  Za  are  independent  of  the  other  matrices.  Now  consider  the  conditional 
covariance  between  an  element  zof  of  Zty  and  z0'/'  of  Za-  It  is 

(6.31)  £  {  zgf  Zg’f’  |  Wi,  Wj }  =  jj— yyi  £  2  <  ew  “  Xy*v) 

XyW/'&') 


Xy  H~  Xy-j-  XyXy 
(Xy-Xy)2 


8k/'  8fk’ 

k,k’ 


Xi  +  Xy  +  XyXy  (*)  (») 

WgfWg'f  . 


(Xy  Xy)  2 


We  can  indicate  the  conditional  covariances  between  the  columns  Z  = 
(zi,  .  .  .  ,  zp)  in  matrix  form.  If  g,  f  ^  ri,  the  conditional  covariance  between  z0 
and  Z/  is 
(6.32)  £\ 


zgz'f\0,W}  = 


f_L_ 

I  2\i  +  4 


(/  bgf  ) 


0 


0 


Xi  +  X2  +  X2 

(x^-  x2yT 


/5„/ 


0 

0 


0 


Xi 

M 


75„/ 


If  g  ^  ri  and  rx  +  1  ^  ^  fi  +  ^2,  the  conditional  covariance  is 


(6.33)  £{z„  z'f\0,  W) 


Theorem  5.  Let 


0  0  ...  0 
0  ...  0 

(X1-X2)2  f  0 


0  0  .  .  .  OJ 


(6.34) 


_ 

Dn=  ^  V^»7 •„  (»)ej  [y„+  \/«rs  (w)ej 

c  =  l 


ze;Aere  the  p-dimensional  vectors  yi,  .  .  .  ,  yQ2  (qt  ^  p)  are  independently  distributed 
according  to  N{ 0,  I )  and  rg(n )  =  v%( n)  — >  \/Xy,  n  +  .  .  .  +  r»_i  +  1  ^  g  ^ 
fi  +  .  .  .  +  ri,  Xy  >  Xy,  i  <j  (i,  j  =  1,  .  .  .  ,  h  +  1);  XA+i(»)  =  0.  Le/  A„  inde¬ 
pendently  distributed  according  to  W(I ,  n).  Define  Xn  and  4»n  by  means  of  (4.3),  (4.4) 
and  the  conditions  that  xg,ri  + ...  +  ri-i  +i(n)  ^  0,  ri  +  .  .  .  +  r»-i  +  1  ^  g  ^ 
ri  +  .  .  .  +  r,-,  and  <I>n  is  diagonal  with  diagonal  elements  in  descending  order.  Let 
each  of  Xn  and  On  be  partitioned  into  (h  +  l)2  submatrices  of  ri,  ...  ,  rA+1  rows  and 
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rh+ 1  columns.  Let  0n  be  defined  by  (6.4)  and  Wn  and  Zn  defined  by  (6.2), 
(6.3)  and  WfinYWfn)  =  I.  The  limiting  distribution  of  0n,  Wn  and  Zn  is  that 
of  0,  W  and  Z  similarly  partitioned ,  which  can  be  described  as  follows :  The  matrices 
0i,  Wit  Z  a  are  independent  of  Oj,  Wj,  Zjj  (i  7^  j) ;  the  distribution  of  0t,  Wi  and 
ZiU  i*h+  1,  is  that  of  0,  W  and  Z  given  in  theorem  4  with  p  =  rt;  the  distribution 
of  0a+i,  Wa+i  A+u+i  w  that  of  0,  W  and  Z  given  in  theorem  3  with  p  =  rh+ x, 
m.  =  q2  —  p  -j-  rh+\ ;  the  conditional  distribution  of  Z  given  0  and  W  is  normal;  the 
conditional  expectation  of  Zn  (i  7^  j)  is  0;  the  conditional  covariance  matrix  of  z„  and 
z/,  rx  -f-  .  .  .  +  ft-.  1  +  1  =  gf  f  ^  fi  +  •  •  •  +  fi,  consists  of  nondiagonal  blocks 
of  zeros  and  the  i-th  diagonal  block  is 

2  x  +  4  ^ 8°f  +wf)wol)  ) 


and  the  j-th  diagonal  block  (j  7^  i)  is 


Xt  4~  Xy 
(X,  —  \j) 2 


/5tf/ ; 


the  covariance  matrix  of  z„  and  zf  rx  +  .  .  .  -f  r,_ 1  +1  ^  g  ^  rx  +  .  .  .  +  fi, 
r\  -f-  .  .  .  +  rj- 1  +  1  ^  ^  ri  +  •  •  •  +  rj  (i  t*  j)  consists  of  OV  except  the  j-th  i-th 


block  which  is 


Xj  — (—  Xy4"  X*Xy 
(Xi  -  \f)^ 


W/j)W 


(»)' 

0 


7.  The  asymptotic  distribution  of  characteristic  roots  and  vectors 


The  columns  of  the  matrix  Cn  defined  in  section  2  are  the  characteristic  vectors 
of  Dn  in  the  metric  of  An.  We  choose  C„  so  that  the  elements  of  the  first  row  of  Cain) 
are  nonnegative.  Let 


(7.1) 


C„  = 


(Cn  (») 


C21  ( n ) 


C12  (») 

C22  («) 


Ci,h+i  (w) 
C2,fc  +  l  («) 


^Cfc+i.i  (n)  Cfc+1.2  («) 


Ca+1,^+1  (w) 


Tx(«) 

0 

0 

+  » 

Tu(«) 

F«(») 

.  rw+i(»)  ' 

0 

V2(n)  . 

0 

Vn 

F2i(») 

r22(«) 

Y2,h+i(n) 

.  0 

0 

.  n+i  («). 

-F*  +  i,i(w)  Ta+i,2(w) 

Yh+i,h+i(n)  > 

=l,”+ 


where 

(7.2) 

Vi(n)Vi(n)  =/, 

(7.3) 

tf(n)r«(»)  =  Y'fn)Vfn) 
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If  Xn  =  JnCn  1,  then  C  n  Cn  Jny  where  Jn  is  a  diagonal  matrix  with  diagonal  ele¬ 
ments  +  1  and  —  1.  Now  define  V„  and  Yn  in  terms  of  Cn  as  Vn  and  Yn  are  in  terms 
of  Cn.  First  let  us  show  that  for  nonrandom  Wn  — >  W,  Zn—>Z  (which  define 
Xn  ->  X),  Vn  ->  W'  and  Yn  -  Z' .  We  have 


(7.5) 


Cn  =  Vn 


r-  -  (w-  ■ +  V  ZT = (/+ V  w'V'w: 


1 

Vn 


WnZnWn  +  -{WnZn)*Wn 

n 


for  n  sufficiently  large.  The  i- th  diagonal  block  of  (7.5)  is 

(7.6)  F,(«)  +Vr"(n)  =W‘(n)  -VWl(n)Z"(n)W‘(n)  VTiAn) 

=  (#)  -  V  z'«  («)  +  ^  (»>  ■ 

where  the  elements  of  T„(«)  are  bounded.  Multiplying  each  side  of  (7.6)  on  the 
left  by  its  transpose,  we  obtain 

(7.7)  i+V  <B)  (B)  +r« (B)  ^ (B>  1  ?:<  (»)  («) 

^  iif,  (n)  (*)  +*«  (»)  w;  (»)  i  +~  sj(  <») . 

Subtracting  I  from  both  sides,  multiplying  by  7,(w)  and  using  (7.3)  and  (6.3), 
we  find 

(7.8)  Yu  («)  =  -  Vi  (»)  IF,  (»)  (*»)  +^=  F,  («)*,■  (») . 

We  can  show  (by  means  of  an  argument  similar  to  that  used  in  section  4)  that  the 
elements  of  Vl(n)Rl(n)  =  Su(n )  —  V i(n)Y'a{n)Y «(n)  and  of  /?,(«)  are  bounded. 
Inserting  (7.8)  in  (7.6)  we  obtain 

(7.9)  Vi  («)  =(tf;<«)  ~VZiiM  +V,<M)il+^WiMZ'“(n) 

+  (»))”' =  «<(»)  +  ±©<(«0 

for  »  sufficiently  large.  It  is  clear  from  this  that  7,(«)  — »  and  from  (7.8)  that 

Yu(n )  — >  —Z'a  (as  Wn  —■ ►  W  and  Z„  — >  Z).  A  nondiagonal  block  of  (7.5)  is 

(7.10)  =  -VW,MZii<-n)W'lM  VTii{n)- 

Clearly,  Ya(n)  ->  -  W-ZaW'. 
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Cn  and  Cn  are  different  (for  fixed  Wn  and  Zn )  only  in  that  columns  of  one  are 
multiplied  by  —  1  to  obtain  columns  of  the  other.  However,  for  n  large  enough  the 
sign  of  the  elements  of  the  first  row  of  C«(»)  are  those  of  Wi(n)  [=  V, •(«)],  which 
are  all  positive  (if  the  elements  are  different  from  0).  Thus  Cn  =  Cn  for  n  large 
enough.  Therefore,  for  nonrandom  Wn  — >  W  and  Zn  — >  Zf  V„  — *  W' ,  Yu(n)  — >  —  Z'u 
and  Yij(n)  — >  —  W'iZuWf  The  discontinuities  of  the  limiting  transformation  have 
limiting  probability  0.  Thus  the  limiting  distribution  of  the  random  matrices  0„F„ 
and  Yn  is  the  distribution  of  the  random  matrices  0,  V  —  W',  Y  =  —  W'ZW 
where  the  distribution  of  0,  W  and  Z  is  given  in  theorem  5. 

The  distribution  of  0  and  V'  =  W  has  been  given  explicitly.  From  the  condi¬ 
tional  distribution  of  Z  let  us  find  that  of  Y.  Consider  first  Yu,  i  9*  h  1 

(7.ii)  £{r„|e,ir)-  -gU»l  e,  w) -L-w1,*, 

= _ i _ 70 . 

2  (X,  +  2) 


The  covariance  between  two  elements  of  Yu,  say  yab  and  ya'b'  is 

1 


2  A,  +  4 


U 


s  ,  (i)  (*k 

066'  *T  vab'  Va'b)  ■ 


The  matrix  Fa+u+i  is  normal  with  mean  0  and  covariance  given  above  (for  X,  =  0). 

Now  consider  Fjy  =  —  W'iZijW-  and  Yu  =  —  WjZjlW'i.  The  joint  conditional 
distribution  is  normal  with  zero  means.  The  variance  of  the  elements  in  Yu  is 
(Xi  +  Xy  +  Xy ) j (Xj  —  Xy)2  and  that  of  the  elements  in  Fyy  is  (X,-  +  Xy  +  X?)/ 
(Xj  —  Xy)2.  The  covariance  between  elements  in  Fjy  is  0  as  are  those  between  ele¬ 
ments  in  Fyy.  The  covariance  between  an  element  yab  in  Yu  and  yav  in  Fy,  is 


Theorem  6.  Let 


Xy  ~l~  Xy-}~  XjXy 
(Xj  —  Xy)  2 


_  _ 

Dn  =  2  ^ n  t°  [y»  +  '> 

0  =  1 


where  the  p-dimensional  vectors  yi,  .  .  .  ,  yq2  (q*  ^  p)  are  independently  distributed 
according  to  N( 0,  I)  and  ra(n)  =  y/\i(n)  — >  -\/Xi,  f  1  +  •  •  •  +  fi-i  +  1  ^  g  ^ 
f  1  +  •  •  •  +  Xj  >  Xy,  i  <  j  (i,  j  =  1,  •  ■  ■  7  h  +  1);  X*+i(»)  =  0.  Ze/ 
pendently  distributed  according  to  W{I ,  n).  Define  Cn  and  On  fry  means  of  (2.11), 
(2.12)  awd  the  conditions  that  cri+  . . .  +r<-i  +  i,o(n)  =  0,  ri  +  •  •  •  +  r%- 1  +  1  = 
g  S  ri  +  .  .  .  +  rj  (i  =  1,  .  .  .  ,  h  -J-  1)  and  <t>„  is  diagonal  with  diagonal  elements 
in  descending  order.  Let  each  of  Cn  and  <I>n  be  partitioned  into  (h  -f-  l)2  submatrices 
of  ri,  ,  rh+ 1  rows  and  r\,  .  .  .  ,  rn+i  columns.  Let  0„  be  defined  by  (6.4)  and  Vn 

and  F„  be  defined  by  (7.1),  (7.2)  and  (7.3).  The  limiting  distribution  of  0„,  7„  and 
Yn  is  that  of  0,  V  and  Y  similarly  partitioned ,  which  can  be  described  as  follows :  The 
matrices  ©j,  7j,  Yu  are  independent  of  0y,  7y,  Fyy  (i  9*  j);  the  distribution  of  0j  and 
V[  is  that  of  ©  and  W  given  in  theorem  4  with  p  =  r,,  i  9*  h  +  1,  the  conditional 
distribution  of  Yu  given  0j  and  Vi  is  normal  with  mean  (2Xj  +  4)_l7j0j/  the  distri¬ 
bution  of  ©/,+!,  V'h+ 1  is  that  of  0  and  W  given  in  theorem  3  with  p  =  rh+i,  m  = 
q2  —  P  +  rh+ 1;  the  conditional  distribution  of  Yh+ 1,  6+1  is  normal  with  mean  0;  the 
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conditional  distribution  of  Y  given  0  and  V  is  normal;  the  conditional  expectation  of 
Yu ,  i  9*  j,  is  0;  the  conditional  covariance  matrix  of  ya  and  y'f,  +  . . .  -J-  r,_i  -j- 
1  ^  g,f  ^  r\  +  •  •  •  +  ri,  consists  of  nondiagonal  blocks  of  zeros  and  the  i-th  diagonal 
block  is 


1 

2X<  +  4 


(/  8gf  +  Vf  ^  Vg^  ) 


and  the  j-th  diagonal  block  ( j  9^  i)  is 


X,  +  Xy  +  Xy 
(X;  —  Xy)  2 


1 80f ; 


the  covariance  matrix  of  ya  and  y'f,  ri  +  .  .  .  +  *%- i  +1  ^  g  ^  +  •  •  •  +  ft,  r\ 

r,_ i  +1  ^  f  ^  ri  +  .  .  .  +  fy  (i  9*  j)  consists  of  O’s  except  the  j-th,  i-th  block 
which  is 


X  i  “l-  X  y  “j-  X  i  Xy 
“(Xi-Xy)” 


..OWi)' 

V/ 


Now  let  us  consider  the  asymptotic  distribution  of  C*  =  r „C„.  Let  r  satisfy 


(7.12)  B2  lim  Q„B:r  =  SrA  . 

n — >  00 

If  the  diagonal  elements  of  A  are  all  different  and  if  7iy  5*  0,  j  =  1,  .  .  .  ,  P,  then 
the  restrictions  7iy  >  0,  and  YySyy  =  1  determine  r  uniquely.  If  the  same  re¬ 
strictions  are  placed  on  each  rn  then  r„  — >  r  as  n  — *■  °°  because  the  set  of  char¬ 
acteristic  vectors  is  a  continuous  function  of  Qn  (which  approaches  the  limit, 

lim  (jn).  If  the  diagonal  elements  of  A  are  not  all  different,  then  another  in- 
*— *  00 

determinacy  is  involved.  Partition  T  in  the  manner  that  the  matrices  in  sec¬ 
tion  6  were  partitioned; 

(7.13)  r=  /rn  . . .  n.A+1  \ 


A+i.i 


A+l.fc+l/ 


Let  O  be  an  orthogonal  matrix  of  the  form 


(7.14) 


0  ...  .  0 
o2  . .  •  o 


'l 


1 0  0  ...  Oh+i ) 

We  require  that 

(7.15)  r'sr  =  /. 


Then  TO  also  satisfies  (7.15) ;  that  is,  there  is  an  indeterminacy  of  such  an  orthogo¬ 
nal  transformation.  This  indeterminacy  can  be  removed  by  putting  restrictions 
on  r  (such  as  requiring  that  the  first  column  of  r  lie  in  a  certain  r%  —  1  dimen¬ 
sional  hyperplane,  the  second  in  a  certain  r\  —  2  dimensional  hyperplane,  etc., 
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with  an  element  in  each  column  having  a  specified  sign).  For  all  n  greater  than  some 
particular  integer,  the  same  restrictions  can  be  imposed  on  rn.  With  these  restric¬ 
tions  imposed,  r„  — ►  r. 

Then 

(7.16) 

Let 

(7.17) 


C*  =  r  CL  =  r V 


l 


r T_. 


'n  n  n  n  n  '  ^  n  n 

v*  =  r  v 

V  n  A  n  "  n  > 


(7.18) 
then 

(7.19) 


y*  =  r  y  • 

**71  4  n-*  n  j 


V*  and  Y*  are  functions  of  Vn  and  Yn.  As  n  — »  <» ,  the  functions  approach  limits; 
that  is,  for  fixed  Vn=  V  and  Yn  =  F,  F*->  TF  =  V*,  say,  and  F*  — >  TF  =  F*, 
say.  Thus  by  Rubin’s  theorem  the  limiting  distribution  of  the  random  matrices 
K  and  Yt  is  that  of  TV  =  V*  and  TY  =  Y*. 

The  distribution  of  V*  is  that  of  TF.  The  distribution  of  0  is  given  in  section  6. 
The  conditional  distribution  of  Y*  given  V*  and  0  can  be  found  from  that  of  F. 
We  have 


,  4+ 1 

(7.20)  e (F*#| e,  V)  =  Xr..£in;| e.  v)  =r, 

*■=1 


1 


2  (A, +  2) 


Vt%l% 


e{Y*,K+ i\0,V}=O. 


j  9±h+  1  ; 


The  conditional  covariances  are  easily  obtained  from  theorem  6.  Let  y*  be  the 
a-th  column  of  F*.  Then  the  conditional  covariance  matrix  of  this  vector  for 
a  ^  ri  is 


(7.21) 


1 


2  Ax  -j-  4 


(f +»?»"') 


0 

Ai+  A2  +  A  2 
(A1-A2)2  ' 


0 


r' 


0 


0 


0 


1 


2  A,-  +  4 


r^z+viV0' 


1  yAt  +  Ay+A^  > 
}  (A,-  —  Ay) 2  ' 


Tu  ' 


where 

(7.22) 


r,= 
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If  a  a*  and  a,  a*  ^  ri}  the  conditional  covariances  between  ya  and  ya*  are 
(7.23) 


1  r^i^rl. 


2Xi  +  4 

If  a  ^  r\  and  r\  +  1  ^  a*  ^  r\  -f-  r2,  then  the  covariances  are 


(7.24) 


0  0  .  .  .  0 
'X1+X2+X1X2  (2)  (l)'  n  n 

Va*  Va  U  .  .  .  U 


(X!-X2)2 

0 


0  .  .  .  0 


0  ...  0; 


r' 


Xi  +  X2  +  XiX2 .,(2)  ,,(1)  v 

(x.-W  TlV - v°  r" 


Theorem  7.  Let  the  p-dimensional  vectors  xa(a  =  1 ,  ...  N  ^  p)  be  independently 
distributed  according  to  N (BiZia  -j-  B2z2a,  £)>  where  the  matrix  of  vectors  (z'ia,  z2a) 
is  of  rank  p  and  zia  has  qi  components  and  z2a  has  q2  components.  Let  Bn  = 

[B\(n)  ,B2{n)\  be  defined  by  (1.3);  let  Qn  =  —  Qn  be  defined  by  (1.5)  where  n  = 

ft 

N  —  q\  —  q2;  let  At  be  defined  by  (1.6).  Let  On  be  a  diagonal  matrix  whose  diagonal 
elements  are  the  roots  of  (1.7)  arranged  in  descending  order  of  size.  Let  C£  =  [c?(w), 
.  .  .  ,  c%(n)\  be  composed  of  the  corresponding  vectors  satisfying  (1.8)  and  (1.9).  Let 
r2(w)[r2(«)  Sr  ...  Sr  Tp(«)]  be  the  roots  of  (2.1).  We  assume  Qn  approaches  a  non¬ 
singular  limit  in  such  a  way  that  rg(n)  —  y/Xfn)  — >  v%,  r\  +  .  .  .  +  r,_i  +  1  S 
g  25  r\  +  .  .  .  +  r»,  Xt-  ^  Xy,  i  <  j  {i,  j  =  1,  .  .  .  ,  h  +  1),  \h+i(n)  =  0.  Let  T  be  a 
matrix  satisfying  (7.12)  and  (7.15)  where  A  is  the  limit  of  (6.1),  and  satisfying  other 
restrictions  to  make  T  uniquely  defined.  Let  rn  =  hfi (»),...  ,  Y p(n)\  be  com¬ 
posed  of  vectors  satisfying  (2.2)  and  (2.3)  and  the  additional  restrictions  on  T  ( for  n 

sufficiently  large).  Let  @n  be  defined  by  (6.4).  Let  C£  =  V*  -t~  Y*,  where 


(7.25) 


r~  V  =  v 

*  w  w  n  w  n 


( Vx  (») 


0 


0 

V2(n) 


0 

0 


0  0  .  . .  Vh+i («) J , 


V'iin)Vi{n)  =  /,  the  elements  of  the  first  row  of  Vi(n )  are  nonnegative  and 
Vi(n)Yu(n )  =  Tt'»(«)Ti(w),  w/rere  Yu(n)  is  the  i-th  diagonal  submatrix  of  r^F*  =  Yn. 
As  n—*  °° ;  the  limiting  distribution  of  0„,  V*  and  Y*  is  that  of  0,  rF  and  Y*,  simi¬ 
larly  partitioned,  which  may  be  described  as  follows’.  The  marginal  distribution  of  0 
and  V  is  such  that  0„  F,  is  independent  of  0y,  Fy  (i  5^  j) ;  the  marginal  distribution 
of  0i,  Vi  is  that  of  0,  W  given  in  theorem  4  with  p  =  ri}  i  h  4-  1 ;  the  distribution 
of  0ft+i,  Vh+ 1  is  that  of  0,  W  given  in  theorem  3  with  p  —  rh+ 1,  m  =  q2  —  p  +  rh+ 1; 
the  conditional  distribution  of  Y*  given  0,  F  w  normal;  the  conditional  expectation  of 
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a  submatrix  Y*j  is  given  by  (7.19);  the  conditional  covariance  matrix  of  y*  and  y*', 
fi  +  .  .  .  +  r,--i  +  1  ^  gj  ^  rx  +  .  .  .  +  rlf  is 

(7-26)  [r,i;/,)i;„(,')  r'+  M\i\]  +  r/r’i; 


T*1  (Xi-Xy) 

jVt 


conditional  covariance  between  y*  and  y*',  rx  .  .  .  +  r»-i  +  1  ^  g  ^ 
ri  +  .  .  .  +  r,-,  ri  +  .  .  .  +  ry_x  +  1  ^  ^  ri  +  •  •  •  +r;  (*  5*  i),  is 


(7.27) 


X*  +  Xy  +  XtXy  _  (>)  (») ' 

I  jVf  Vg  I  i 


(X.-X,)2 

w/fcre  I\  is  defined  in  (7.22)  and  v^l)  is  the  g-th  column  of  Vi. 

A  special  case  of  considerable  interest  is  the  case  of  r,  =  1,  i  h  +  1  and 
=  p  —  h.  Then  Yi}  consists  of  one  element  for  i,  j  5*  h  +  1.  In  this  case 
Vi  =  1  for  /  5^  h  +  1.  We  can  easily  express  the  conditional  distribution  of  Y 
given  Vh+i  by  integrating  out  di,  .  .  .  ,  dh.  The  marginal  distribution  of  0,-  is 
iV(0,  2X2  +  4Xt)  and  the  conditional  distribution  of  yu  is  iV[0i/(2Xj  +  4), 
l/(Xi  +  2)].  di  and  y„  are  independent  of  the  other  variables.  The  marginal  dis¬ 
tribution  then  of  yu  is  iV(0,  1/2). 

In  this  case  £{Y* |  Vh+ 1}  =  0.  The  conditional  covariance  between  y*  and  y*', 
a  ^  h  is 

(7.28)  cylyl'  =  ir.r;  +  £  r,r, . 

7V« 

The  conditional  covariance  between  y*  and  y**,  a  5*  a*,  a,  a*  ^  h  is 

(7.29) 

The  conditional  covariance  between  y*  and  y**,  a  ^  h,  h  +  1  ^  a*  S  P,  is 

(7.30) 

A a 

The  conditional  covariance  between  y*  and  y*»,  h  +  l  ^  a,  a*  ^  p,  is 

(7.31)  Q  { yaya*  \  Vh+i }  =  irWi  (/ baa*  +  t'iA+l)  )  rA+i  +  8aa* . 

7=1 

Clearly,  if  =  1,  i  =  1,  .  .  .  ,  h  +  1  =  p,  then  Vn  =  /,  and  the  limiting  dis¬ 
tribution  of  ©„  and  Y*  is  such  that  the  marginal  distribution  of  Y*  is  normal 
with  mean  0  and  covariances  derived  from  above. 


C  ( vlyt  1  r,+,  I  -  -  r„rl 

^  Aa  —  ha*) 


£  { y*y*a*  I  vk+i }  =  J-  rh+1Va*+1)ra . 

A, n 


8.  Remarks 

8.1.  Use  of  N  and  n  =  N  —  q.  In  section  2  we  defined  Qn  as  —  Q„.  The  asymptotic 
distributions  obtained  are  exactly  the  same  if  one  uses  Q„  for  the  roots  of 

1 82^:  Qn B2  —  p2S|  are  multiples  by  n/N  of  the  roots  of  (2.1).  These  roots  con¬ 
verge  also  to  Xi,  .  .  .  ,  X/,-1 1,  respectively,  and  for  each  n  the  multiplicities  are  the 
same  in  the  two  cases.  Using  —  Qn  changes  the  definition  of  the  sample  roots  again 


130 


SECOND  BERKELEY  SYMPOSIUM :  ANDERSON 


by  a  factor  of  n/N.  We  might  also  define  A*  in  terms  of  N  instead  of  n  and  nor¬ 
malized  c*  in  terms  of  N  instead  of  n.  Asymptotically  the  effect  of  using  N  instead 
of  n  disappears.  Rubin’s  theorem  can  be  used  to  prove  each  such  statement  rigor¬ 
ously. 

8.2.  The  limiting  probabilities.  It  is  interesting  to  see  for  what  sequences  of  sets 
in  the  space  of  the  characteristic  vectors  the  limiting  probabilities  are  defined.  As 
a  simple  example,  suppose  p  =  r\  —  2  and  £  =7.  We  shall  consider  a  sequence 
of  sets  for  one  vector  c\  and  another  sequence  for  c2  defined  in  the  same  plane. 
Consider  a  segment  on  the  unit  circle  in  the  right  half  plane.  The  regions  for  ci  con¬ 
tain  this  segment  and  as  n  — >  oo  the  regions  converge  to  the  segment.  The  bound¬ 
aries  converge  as  1  /y/n.  Consider  the  segment  of  the  unit  circle  in  the  right  half 
plane  composed  of  the  points  which  are  90°  from  the  points  in  the  other  segment. 
There  is  a  corresponding  sequence  of  regions  which  close  down  on  this  segment  as 
n  increases.  The  limiting  probabilities  are  defined  for  these  sequences. 

8.3.  Cases  of  special  interest.  Two  cases  of  the  model  discussed  here  are  of  par¬ 
ticular  interest.  The  one  occurs  when  the  “fixed  variate”  vectors  za  (in  section  1) 
are  composed  of  dummy  variates;  that  is,  variates  that  are  0  or  1  (see  [1],  for  ex¬ 
ample).  These  can  be  chosen  so  that  B za  —  in,  a  =  Ni  +  .  .  .  +  iV*_i  +  1,  .  .  .  , 
Ni  +  .  .  .  +  Ni  (Ni  -j-  .  .  .  +  Nq  =  N).  The  first  Ni  xa  are  observations  from 
the  first  population,  etc.  If  we  require  that  Ni  =  kiN  as  N  — >  °° ,  then  the  multi¬ 
plicities  of  the  roots  of  (2.1)  are  unchanged  as  N  — >  <» . 

It  can  be  shown  that  if  \Zn[rl(n)  —  r)(n)]  — »  0  as  n—>  °°,ri +  ...-}-  r,_i  -f- 
1  g  g,  f  ^  fi  +  .  .  .  +  r«,  i  =  1,  .  .  .  ,  h  +  1,  then  ©„,  Wn,  and  Zn  (also  0n,  V*, 
and  Y*)  can  be  defined  in  the  manner  of  this  paper  and  have  the  same  limiting 
distribution  as  given  here.  Thus  we  can  weaken  our  conditions  slightly.  If  = 
kiN  to  within  rounding  error  in  the  case  mentioned  above,  the  same  theory  applies. 

If  the  fixed  variate  subvectors  z2a  are  not  composed  of  dummy  variates,  in  gen¬ 
eral  the  nonzero  roots  of  (2.1)  will  be  simple  for  all  n  and  in  the  limit.  Then  the 
theory  at  the  end  of  section  7  applies. 
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